AD-A218  868 


4 


OTIC  file  copy 


§ 


TR/90-01  \ 


4P0^R-TR*  9  0 


0  2  3? 


Intelligent  Real-Time  Problem-Solving: 
Issues,  Concepts  and  Research  Methodology 


January  25,  1990 


Prepared  by: 

Stanley  J.  Rosenschein 
Teleos  Research 

576  Middlefield  Road  Palo  Alto,  CA  94301 


With  contributions  from: 

Michael  Fehling  (Stanford  University) 
Matthew  L.  Ginsberg  (Stanford  University) 
Eric  J.  Horvitz  (Stanford  University) 
Bruce  D.  D’Ambrosio  (Oregon  State  University) 


Prepared  for: 

Air  Force  Office  of  Scientific  Research 
Bolling  Air  Force  Base 
DC  20332-6448 


Contract  Number  F49620-89-C-01 17 


PfeTHfflirriON  STATEMENT  a 

Approved  for  public  rdwN) 
Distribution  Uattmlted 


90  02  054 


SIFIED 


security  classification  of  this  page 


REPORT  DOCUMENTATION  PAGE 

Form  Approved 

OMB  No.  0704-01 88 

1  la.  REPORT  SECURITY  CLASSIFICATION 

1  UNCLASSIFIED 

1b.  RESTRICTIVE  MARKINGS 

2a.  SECURITY  CLASSIFICATION  AUTHORITY 


2b.  DECLASSIFICATION /  DOWNGRADING  SCHEDULE 


4.  PERFORMING  ORGANIZATION  REPORT  NUMBER(S) 

TR/90-01 


6a.  NAME  OF  PERFORMING  ORGANIZATION 


3  .  DISTRIBUTION /AVAILABILITY  OF  REPORT 
Approved  for  public  release  ; 
distribution  unlimited. 


S.  MONITORING  ORGANIZATION  REPORT  NUMBER(S) 


6b.  OFFICE  SYMBOL 
(If  applicable) 


TELEOS  RESEARCH 


6c  ADORESS  (City,  State,  and  ZIP  Code) 

576  MIDDLEFIELD  ROAD 
PALO  ALTO,  CA  94301 


8a.  NAME  OF  FUNDING /SPONSORING 
ORGANIZATION 


7b.  ADDRESS  tOty.  State,  and  ZIP  Code) 

Mio 

fcoV\ .  -  s  At  6,  kc.  ao's'ss.  -  cw  q  $r 


9.  PROCUREMENT  INSTRUMENT  IDENTIFICATION  NUMBER 


F49620-89-C-01 17 


10.  SOURCE  OF  FUNDING  NUM8ERS 


AFOSR 


8c  ADDRESS  (City,  State,  and  ZIP  Code) 
BUILDING  410 
BOLLING  AIR  FORCE  BASE 
WASHINGTON,  D.C.  20332-6448 


1 1 .  TITLE  (Include  Security  Clarification) 

INTELLIGENT  REAL-TIME  PROBLEM-SOLVING:  ISSUES,  CONCEPTS  AND  RESEARCH  METHODOLOGY  (U) 


WORK  UNIT 
ACCESSION  NO. 


12.  PERSONAL  AUTHOR(S) 

STANLEY  J.  ROSENSCHEIN 


13a.  TYPE  OF  REPORT  |1 3b.  TIME  COVERED 

CONTRACTOR  REPORT  :FINA|  FROM  AUG  89  TO  NOV  89 


16.  SUPPLEMENTARY  NOTATION 

STANLEY  J.  ROSENSCHEIN:  TELEPHONE  415/328-8800 


14.  DATE  OF  REPORT  (Year,  Month.  Day)  IS.  PAGE  COUNT 

90/01/25  37 


17. 

COSATI  COOES  “7 

FIELD  | 

|  GROUP  j 

1  SUB-GROUP  1. 

18.  SUBJECT  TERMS  ( Continue  on  reverse  If  necessary  and  identify  by  block  number) 
REALPTIME,  PROBLEM-SOLVING,  KNOWLEDGE-BASED  SYSTEMS,  , 
„ EMBEDDED  SYSTEMS,  INTELLIGENT  AGENTS.  Rpz:  /  --f €.  57>4 


l^t  ABSTRACT  (Continue  on  reverse  if  necessary  and  identify  by  block  number)  FX 

The  Air  Force  has  sponsored  a  study  aimed  at  laying'out  ^research  issues  in  the  area  of 
intelligent  real-time  problem-solving.  As  part  of  this  study,  a  team  led  by 
Dr.  Stanley  J.  Rosenschein  of  Teleos  Research,  has  reviewed  topics  in  this  area  and 
has  participated  in  a  workshop.  This  report  contains  a  position  statement  of  the 
Teleos  team  prepared  for  that  workshop,  along  with  a  discussion  of  the  research  issues 
panel  held  at  the  workshop  itsOlft  and  of  methodologies  for  evaluating  intelligent 
■real-time  problem-solving  systems* 


20.  DISTRIBUTION /AVAILABILITY  OF  ABSTRACT 
Iff  UNCLASSIFIED/UNLIMITED  □  SAME  AS  RPT. 


22a.  NAME  OF  RESPONSIBLE  INDIVIDUAL 


21.  ABSTRACT  SECURITY  CLASSIFICATION 
□  OTIC  USERS  UNCLASSIFIED 


122b.  TELEPHONE  (include  Area  Code)  I  22c.  OFFICE  SYMBOL 


OO  Form  1473,  JUN  86 


Previous  editions  are  obsolete. 


SECURITY  CLASSIFICATION  OF  THIS  PAGE 


Contents 


1  Introduction  3 

2  Team  Position  Statement  for  IRTPS  Workshop  5 

2.1  Background .  5 

2.2  General  Programmatic  Issues  .  6 

2.3  Historical/Interdisciplinary .  7 

2.4  Core  Research  on  Resource- Bounded  Reasoning .  7 

2.4.1  Base-level/Meta-level  Reasoning .  8 

2.4.2  Anytime  Reasoning .  8 

2.4.3  Compilation  (Compiled  vs.  Deliberative  Reasoning) .  9 

2.4.4  Pre-Emptive  Control .  9 

2.5  Experimental  Validation .  9 

3  Notes  on  Methodologies  for  Evaluating  IRTPS  Systems  11 

3.1  Introduction .  11 

3.2  A  Model  of  Embedded  Systems .  12 

3.3  Measurement  and  Utility .  12 

3.4  Evaluation .  14 

3.4.1  Analytical  Evaluation  Methods .  14 

3.4.2  Experimental  Evaluation  Methods .  15 

3.5  Other  Implications .  17 

3.5.1  Implications  for  Requirements  Specification  .  18 

3.6  Implications  for  an  Experimental  Testbed .  19 

4  Report  on  Research  Issues  Session  of  IRTPS  Workshop  20 


1 


4.1  Introduction .  20 

4.2  Candidate  IRTPS  Paradigms .  22 

4.2.1  Planning .  22 

4.2.2  Logics .  22 

4.2.3  Algebraic  Techniques .  23 

4.2.4  Decision  Theory . 23 

4.2.5  Control  Theory .  23 

4.2.6  State- Based  Automata  Methods .  24 

4.2.7  No-Paradigm .  24 

4.3  Toward  an  IRTPS  Research  Agenda .  25 

4.3.1  Research  Areas .  25 

4.3.2  Prioritization .  25 

4.4  Concluding  Remarks .  28 

5  Annotated  Bibliography  29 


Aeeasslon  for 


NTIS  GRAAI 
DTIC  TAB 
Unannounced 
Justlf loatic 

ear 

□ 

□ 

Rv 

Distribution/ 

Availability  Coda* 

Diet 

Avail 

Spao 

knd/or 

Lai 

2 


1 


% 


4.1  Introduction .  20 

4.2  Candidate  IRTPS  Paradigms .  22 

4.2.1  Planning .  22 

4.2.2  Logics .  22 

4.2.3  Algebraic  Techniques .  23 

4.2.4  Decision  Theory .  23 

4.2.5  Control  Theory .  23 

4.2.6  State- Based  Automata  Methods .  24 

4.2.7  No-Paradigm . 24 

4.3  Toward  an  IRTPS  Research  Agenda .  25 

4.3.1  Research  Areas .  25 

4.3.2  Prioritization .  25 

4.4  Concluding  Remarks .  28 


J 


Chapter  1 
Introduction 


Knowledge-based  systems  technology  has  led  to  practical  applications  in  a  number  of  areas, 
but  to  date  it  has  not  produced  adequate  techniques  for  dealing  with  systems  that  must 
interact  intelligently  with  their  environments  in  real  time.  To  address  this  need,  the  Air 
Force  has  launched  an  initiative  aimed  at  stimulating  a  national  research  effort  on  Intelligent 
Real-Time  Problem  Solving  (IRTPS). 

As  part  of  that  study,  a  team  of  five  researchers,  led  by  Dr.  Stanley  J.  Rosenschein  of 
Teleos  Research  with  the  assistance  of  Professor  Michael  Fehling  of  Stanford  University,  has 
investigated  a  variety  of  topics  and  has  formulated  positions  on  the  research  area.  This  work 
was  augmented  by  discussions  with  members  of  other  IRTPS  teams  and  colleagues  in  the 
research  community.  This  document  contains  the  final  report  of  the  Teleos  team. 

Before  describing  the  contents  of  the  report,  however,  let  us  review  the  activities  in  which 
team  members  engaged  as  part  of  the  IRTPS  project. 

•  Exchange  of  background  documents  and  comments  on  IRTPS. 

•  Discussions  prior  to  program  kick-off  meeting. 

•  Participation  in  one-day  kick-off  meeting  at  Cimflex  Teknowledge,  Palo  Alto. 

•  Several  meetings  between  S.  Rosenschein  and  M.  Fehling  to  discuss  research  issues. 

•  Written  contributions  by  all  team  members. 

•  Two  pre-workshop  meetings  to  discuss  and  finalize  position  paper. 

•  A  series  of  meetings  of  S.  Rosenschein  with  Lee  Erman  of  Cimflex  Teknowledge  and 
Barbara  Hayes-Roth  of  Stanford  University  to  discuss  research  methodology  and  ex¬ 
perimental  evaluation  methods. 

•  Written  contributions  by  S.  Rosenschein  leading  to  a  paper  on  methodologies  for  IRTPS 
research. 

•  Participation  in  IRTPS  workshop  in  Santa  Cruz. 
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•  Panel  on  Research  Issues  chaired  by  Teleos  team  members. 

•  Written  contributions  summarizing  discussion  of  Research  Issues  panel. 

•  Submission  to  Lee  Erman  of  written  Architecture  documents  and  Annotated  Bibliog¬ 
raphy. 

•  Preparation  of  Final  Report. 

This  Final  Report  contains  three  documents.  The  first  document  (Chapter  2)  is  the  Team 
Position  Statement  prepared  by  the  Teleos  team  for  the  IRTPS  workshop.  It  contains  a  dis¬ 
cussion  of  terms  and  issues  and  highlights  certain  recommendations  of  a  programmatic  nature 
regarding  the  management  of  the  national  IRTPS  effort.  The  second  document  (Chapter 
3),  co-authored  by  Dr.  Stanley  Rosenschein  of  Teleos,  Dr.  Barbara  Hayes- Roth  of  Stanford, 
and  Dr.  Lee  Erman  of  Cimflex  Teknowledge,  addresses  issues  of  research  methodology,  espe¬ 
cially  the  experimental  evaluation  of  the  performance  of  embedded  real-time  problem  solving 
systems.  It  states  desiderata  and  criteria  that  could  influence  the  design  of  an  experimental 
IRTPS  testbed.  The  third  document  (Chapter  4),  prepared  collaboratively  by  Teleos  team 
members,  summarizes  the  discussion  held  as  part  of  the  Research  Issues  panel  at  the  IRTPS 
workshop  in  Santa  Cruz.  These  documents  are  followed  by  an  IRTPS  bibliography. 
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Chapter  2 

Team  Position  Statement  for  IRTPS 
Workshop 


Over  the  last  decade  and  a  half,  advances  in  know  ledge- based  systems  technology  have  led 
to  practical  applications  in  a  variety  of  problem  domains.  Despite  these  advances,  current 
technology  remains  inadequate  for  dealing  with  systems  that  must  maintain  real-time  in¬ 
teractions  with  ongoing  processes  in  their  environment.  Because  systems  of  this  type  are 
critical  to  many  important  applications,  particularly  in  defense,  the  Air  Force  has  launched 
an  initiative  aimed  at  stimulating  the  development  of  a  national  research  effort  on  Intelligent 
Real-Time  Problems  Solving  (IRTPS). 

The  goal  of  the  first  phase  of  the  research  initiative  is  to  clarify  terms  and  issues  under¬ 
lying  intelligent  real-time  problem  solving,  and  as  part  of  this  effort  three  research  groups, 
including  the  Teleos  group  led  by  Stanley  Rosenschein,  have  been  chosen  to  investigate  these 
issues  to  help  guide  subsequent  phases  of  the  program.  Phase  1  is  to  culminate  in  a  workshop 
at  which  the  groups  compare  their  findings  and  discuss  issues  with  invited  researchers.  This 
document  contains  an  interim  report  of  the  Teleos  group  and  is  intended  to  serve  as  a  draft 
position  paper  for  the  IRTPS  Workshop.  It  contains  a  preliminary  review  of  terms  and  issues 
followed  by  a  discussion  of  implications  for  the  IRTPS  research  program  being  undertaken 
by  the  Air  Force. 


2.1  Background 

In  the  simplest  terms,  intelligent  real-time  problem-solving  systems  are  characterized  by  the 
following  features: 

1.  They  take  in  a  stream  of  sensory  input  from  the  environment. 

2.  They  produce  a  stream  of  control  output  that  affects  the  environment. 

3.  The  intervening  computation  is  modeled  as  a  reasoning  or  problem-solving  process. 
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4.  Time  matters. 


The  primary  challenge  in  developing  systems  of  this  type  lies  in  reconciling  the  conflict 
inherent  in  the  last  two  attributes.  In  conventional  knowledge-based  applications,  the  system 
is  intended  to  provide  support  to  human  problems  solvers,  where  the  humans  have  sole 
responsibility  for  real-time  interaction  with  the  environment,  and  the  system  is  not  required 
to  exhibit  real-time  performance.  For  example,  these  systems  are  typically  unable  to  adapt 
their  mode  of  operation  to  changing  deadlines  for  an  action  or  the  unpredicted  occurrence 
of  critical  events. 

The  central  programming  model  for  conventional  knowledge-based  systems  work  is  that 
of  an  inference  engine  that  performs  reasoning  steps  and  draws  conclusions  from  a  set  of 
domain-specific  rules  or  facts  stored  in  a  knowledge  base.  Because  chains  of  inference  leading 
to  conclusions  can  vary  greatly  in  length  and  draw  on  facts  in  the  knowledge  base  in  ways 
that  are  hard  to  predict  and  control,  it  is  difficult  to  bound  the  execution  time  of  programs 
in  this  model.  This  is  compounded  by  the  difficulty  of  controlling  the  performance  of  the 
operating  systems  environment  (e.g.,  list-processing)  in  which  most  knowledge-based  systems 
are  embedded.  In  fact,  much  of  the  appeal  of  the  knowledge- based  model  lies  precisely  in  the 
fact  that  it  abstracts  away  from  the  details  of  resource  allocation  required  to  support  inference 
and  allows  the  programmer  to  deal  primarily  with  the  content  of  the  inferences.  However, 
in  real-time  applications,  the  resource  question  cannot  be  ignored.  Without  considering 
execution  time,  rates  of  inference  cannot  be  related  to  rates  of  change  in  the  environment, 
and  the  designer  cannot  be  sure  that  the  system  will  be  able  to  find  satisfactory  output 
responses  in  a  timely  fashion. 


2.2  General  Programmatic  Issues 

We  see  the  IRTPS  program  as  addressing  the  need  for  real-time  knowledge- based  systems 
by  developing  three  mutually-supportive  research  thrusts: 

1.  Historical/Interdisciplinary 

2.  Core  Research  on  Resource- Bounded  Reasoning 

3.  Experimental  Validation 

The  allocation  of  funds  among  and  within  these  thrusts  should  be  at  least  partially  “bottom- 
up,”  i.e.,  it  should  be  responsive  to  the  best  ideas  offered  during  proposal  solicitation.  How¬ 
ever,  an  effort  should  be  made  to  maintain  balance  so  that  major  areas  are  not  entirely 
neglected. 

Because  of  the  limited  funding  allocated  to  the  main  research  effort  of  the  IRTPS  program 
($800,000  over  an  18- month  period),  a  realistic  objective  for  the  program  is  to  seed  research  in 
each  of  the  key  areas  and  to  establish  research  paradigms  and  activities  to  which  additional 
funding  can  be  attracted.  This  is  especially  true  for  the  experimental  component  of  the 
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program;  the  resources  required  to  develop  and  distribute  a  realistic  testbed,  for  instance, 
could  overwhelm  the  program’s  funding,  leaving  little  for  basic  research  unless  special  care 
is  taken  to  leverage  existing  software  and  other  government  programs. 

In  the  following  sections,  we  describe  each  of  the  proposed  program  thrusts  and  list 
several  research  questions  that  might  be  explored  in  each.  These  questions  are  meant  to  be 
suggestive  rather  than  exhaustive,  and  will  undoubtedly  be  supplemented  as  the  research 
proceeds. 


2.3  Historical/Interdisciplinary 

The  current  IRTPS  program  does  not  exist  in  a  vacuum.  IRTPS  systems  have  been  built 
using  existing  artificial-intelligence  (AI)  concepts  and  tools,  and  these  should  be  investigated 
with  a  view  toward  drawing  out  the  lessons  to  be  learned.  Among  the  approaches  that  might 
be  taken  to  these  investigations  are  case  studies,  literature  searches,  and  attempts  to  classify 
existing  systems  in  terms  of  the  conceptual  categories  developed  in  the  IRTPS  program. 

In  addition,  artificial  intelligence  is  not  the  only  discipline  to  be  concerned  with  embedded 
real-time  computation  or  intelligent  problem  solving.  Work  in  decision  theory  and  control 
theory,  in  real-time  operating  systems  and  scheduling  algorithms,  and  in  computer-based 
control  systems  has  resulted  in  a  large  body  of  practice,  theory,  and  engineering  methodol¬ 
ogy.  Research  in  these  disciplines  should  be  explored  in  light  of  IRTPS  requirements  and 
objectives.  In  particular,  methods  should  be  developed  that  will  allow  non-AI  approaches 
to  be  adapted  and  applied  to  IRTPS  problems  and  to  coexist  with  specialized  techniques 
developed  in  the  A I  framework. 


2.4  Core  Research  on  Resource- Bounded  Reasoning 

This  thrust  should  be  aimed  at  developing  a  deeper  understanding  of  the  tradeoffs  inherent 
in  reasoning  under  time  stress.  Constraints  on  a  real-time  reasoning  system’s  inference  and 
representation  lead  to  inescapable  uncertainties  about  the  problems  that  may  be  faced.  A 
real-time  system  immersed  in  a  complex  world  must  grapple  with  uncertainty  associated 
with  both  the  environment  (object-level)  and  the  reasoner  (inference  level).  In  addition, 
agents  must  typically  contend  with  deep  uncertainty  about  the  value  of  future  reasoning. 
However  the  benefits  of  controlling  computational  tradeoffs  in  theoretically  coherent  ways  are 
very  great  in  high-stakes  decision-making  arenas  such  as  medicine,  aerospace,  and  defense, 
and  justify  an  intensive  research  effort  in  this  area.  An  important  part  of  this  research 
will  involve  a  synthesis  of  logical  models  of  reasoning  with  Bayesian  and,  more  generally, 
decision-theoretic  models. 
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2.4.1  Base- level/ Meta- level  Reasoning 

One  set  of  research  issues  focuses  on  the  control  of  search  as  a  method  for  bounding  execu¬ 
tion  time  of  reasoning  processes.  Reasoning  is  modeled  as  a  search  process  in  which  many 
potential  branches  are  available  for  exploration  and  in  which  choices  are  made  about  where 
efTort  should  be  allocated.  The  computation  is  broken  into  elementary  bounded-time  steps 
(with  different  approaches  varying  in  the  grain  size  they  consider  for  these  elementary  units), 
and  attention  is  shifted  under  the  control  of  a  higher-level  process  whose  time  behavior  is 
well  understood.  This  method  of  resource  allocation  similar  to  that  used  in  a  multi-tasking 
operating  systems,  and,  as  in  the  case  of  operating  systems,  interruptability,  pre-emption, 
and  prioritization  are  the  key  terms  of  analysis.  Unlike  operating  systems,  however,  issues 
of  the  content  of  the  computation  as  a  reasoning  process  need  to  be  more  fully  modeled  and 
related  to  the  resource-allocation  algorithm. 

Considerations  of  resource  allocation  lead  directly  to  questions  regarding  the  criteria 
by  which  allocation  is  to  be  judged.  One  approach  (metalevel  control  of  reasoning)  is  to 
formulate  explicit  theories  about  the  reasoning  process  and  the  effect  of  alternative  control 
strategies.  Research  is  needed  on  methods  for  allocating  resources  between  base-level  and 
meta-level  rcasoners.  An  important  goal  of  this  research  is  to  discover  ways  of  limiting  the 
amount  of  time  spent  doing  meta-level  reasoning,  or  more  generally,  to  optimize  the  split  of 
resources  between  base-  and  meta-level  reasoning.  An  attempt  should  be  made  to  identify 
classes  of  tractable,  closed-form,  metalevel  control  problems. 

For  many  applications  it  wil  be  important  to  quantify  ( 1 )  the  value  or  cost  of  achieving  (or 
not  achieving)  goals,  (2)  uncertainty  about  the  existence  of  alternative  states  of  the  world, 
and  (3)  the  costs  of  continuing  to  deliberate  (versus  taking  an  action).  Decision  theory 
provides  a  useful  framework  for  the  design  and  evaluation  of  real-time  systems  as  it  gives  us 
a  language  and  precise  semantics  for  capturing  preferences  and  uncertainty. 

2.4.2  Anytime  Reasoning 

Algorithms  that  can  produce  partial  or  reduced-quality  output  when  their  execution  is  ter¬ 
minated  before  some  complete  soluction  is  produced  have  been  called  anytime  algorithms. 
Useful  properties  that  such  algorithms  can  exhibit  include  monoticity  (improvement  over 
time),  continuity  (gradual  improvement),  and  convergence  in  the  limit.  We  see  a  need  to 
extend  this  line  of  research  to  include  inference  processes,  so  that  so  that  declarative  compu¬ 
tations  can  be  interrupted  before  they  have  finished  running  and  still  produce  useful  results. 

Among  the  approaches  that  can  be  tried  are  the  following: 

1.  Come  up  with  an  anytime  inference  algorithm  that  gradually  and  uniformly  approaches 
the  right  answer. 

2.  Come  up  with  an  anytime  inference  algorithm  that  approaches  the  right  answer  in 
the  large-runtime  limit,  but  might  wander  around  before  doing  so.  (Presumably,  the 
“quality”  of  the  answer  would  increase  uniformly,  in  some  strange  sense.) 
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If  the  second  approach  becomes  necessary,  one  important  question  will  be  how  to  adapt 
the  reasoning  process,  and  the  strategies  that  guide  it,  to  changes  in  the  environment  that  in¬ 
validate  previous  input  or  modify  the  available  problem  solving  resources,  Truth-maintenance 
techniques  will  be  important  here,  though  research  will  be  required  to  adapt  these  techniques 
to  the  real-time  setting.  A  related  set  of  research  topics  involve  modeling  anytime  proba¬ 
bilistic  and  decision-theoretic  reasoning. 


2.4.3  Compilation  (Compiled  vs.  Deliberative  Reasoning) 

So  far,  we  have  only  discused  deliberative  approaches  to  reasoning  and  metareasoning.  It 
can  be  important  to  reduce  complex  deliberation  in  computer-based  reasoners  by  developing 
decision-making  techniques  that  rely  to  some  extent  on  precomputed  or  compiled  responses. 
Such  knowledge  can  be  generated  at  design  time  or  learned  by  agents  over  their  lifetimes. 

Recent  research,  including  that  labeled  reactive  planning,  has  centered  on  the  replacement 
of  unwieldy  solution  mechanisms  and  detailed  representations  of  knowledge  with  compiled 
situation-action  rules.  Such  rules  enable  agents  to  respond  immediately  perceptual  inputs. 
Investigators  have  sensed  that,  for  many  contexts,  explicit  representations  and  deliberation 
will  not  be  necessary  for  good  performance. 

Deliberation  and  reaction  are  merely  two  ends  of  a  spectrum  with  many  intermediate 
points.  One  imporant  topic  of  research  involves  developing  methods  for  optimiz.  ig  the  split 
between  deliberation  and  reaction  in  the  design  of  embedded  agents. 

2.4.4  Pre-Emptive  Control 

This  sub-area  should  explore  pre-emptive  control  strategies  for  multiple,  simultaneous 
problem-solving  activities,  with  a  distinction  being  drawn  between  pre-emption  and  mul¬ 
titask  management.  For  example,  pre-emption  can  occur  in  managing  even  a  single  line  of 
control.  The  policies  that  guide  pre-emption,  and  the  systems  architecture  that  facilitate 
the  implementation  of  these  policies,  need  to  be  characterized  and  studied. 


2.5  Experimental  Validation 

While  abstract  conceptual  models  of  real-time  reasoning  are  an  important  first  step  toward 
the  design  of  practical  IRTPS  applications,  these  models  must  be  augmented  by  a  software- 
development  methodology  that  designers  can  actually  use  to  solve  real-world  problems.  The 
methodology  should  help  the  programmer  instantiate  the  general  model  to  the  particular  data 
structures  and  operations  required  to  satisfy  the  requirements  of  his  particular  application 
problem.  The  methodology  includes  specialized  software-development  tools  that  capture 
key  abstractions,  hide  implementation  details,  but  leave  the  programmer  with  sufficient 
flexibility  and  control  over  what  is  important.  One  goal  of  the  IRTPS  research  program 
should  be  to  enumerate  and  taxonomize  programming  methodologies,  architectures,  and 
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tools,  relating  them  to  the  underlying  conceptual  model  they  support,  and  exploring  whether 
the  abstractions  they  provide  can  be  made  orthogonal  and  be  incorporated  into  a  more 
embracing  IRTPS  software  methodology. 

One  of  the  goals  of  the  IRTPS  program  is  to  provide  a  methodology  for  exploring  archi¬ 
tectures  for  real-time  problem  solving  in  an  empirical  setting.  One  way  of  promoting  this  goal 
is  by  establishing  a  common  conceptual  framework  to  guide  experimental  work  in  the  IRTPS 
community.  Program  management  has  established  an  Experimental  Methodologies  Working 
Group  which  has  outlined  such  a  framework  and  produced  a  draft  document  describing  its 
findings.  The  document  describes  ways  that  models  of  intelligent  embedded  systems  might 
be  modeled  and  evaluated,  focusing  specifically  on  comparing  the  effectivesness  of  agents 
with  different  compositions  and  abilities  immersed  in  distinct  problem  contexts.  Evaluation 
methods  include  theoretical  analysis  as  well  as  experimental  methods,  both  in  simulated  and 
real  environments.  The  document  goes  on  to  describe  what  kind  of  controls  would  be  nec¬ 
essary  to  make  the  results  of  experimental  methods  empirically  meaningful  and  proposes  a 
variety  of  measurement  types  relevant  to  real-time  problem  solving.  Our  team  has  reviewed 
a  preliminary  draft  of  this  document  and,  in  general  terms,  is  in  agreement  with  its  analysis 
and  recommendations. 

A  second,  more  concrete,  way  of  enhancing  the  field’s  experimental  methodology  is  to 
provide  a  common  testbed  that  might  be  used  by  the  IRTPS  community  to  carry  out  empir¬ 
ical  investigations.  Such  a  testbed  would  allow  precise  measurements  of  the  performance  of 
systems  and  architectures.  We  feel  such  a  testbed  would  be  a  useful  component  of  the  IRTPS 
program,  provided  it  can  be  provided  at  reasonable  cost  and  can  be  configured  to  adhere  to 
the  methodological  guidelines  proposed  in  the  Experimental  Methodologies  Working  Group 
document.  One  method  of  leveraging  the  program’s  research  funds  to  good  advantage  would 
be  to  identify  existing  testbeds  produced  under  other  government  programs  that  could  be 
adapted  to  support  experimental  work  in  IRTPS.  Consideration  should  be  given  to  dimen¬ 
sions  of  variability  among  application  domains,  for  example  discrete  vs.  continuous  domains, 
low-level  vs.  high-level  perceptual  data,  and  so  on. 


10 


Chapter  3 

Notes  on  Methodologies  for 
Evaluating  IRTPS  Systems 


With  Dr.  Barbara  Hayes-Roth  (Stanford  University)  and  Dr.  Lee  Erman  (Cimflex  Teknowl- 
edge) 

We  propose  a  framework  for  modeling  intelligent  real-time  problem-solving  systems  em¬ 
bedded  in  an  environment.  Within  this  framework,  measurements  may  be  defined  on  the 
system  and  on  the  environment,  and  particular  measurements  may  be  designated  for  judg¬ 
ing  the  performance  of  the  system.  Although  this  framework  supports  analytical  evaluation, 
we  concentrate  on  its  use  for  experimental  evaluation,  especially  for  evaluating  and  com¬ 
paring  system  architectures.  This  framework  also  provides  a  basis  for  formalizing  various 
requirements  terms,  such  as  “reactivity”  and  “graceful  degradation”. 


3.1  Introduction 

Intelligent  real-time  problem-solving  systems  are  embedded  computer  systems  that  interact 
with  their  environments  in  a  continuous  fashion,  sensing  asynchronous  events  and  acting  in 
ways  designed  to  satisfy  certain  goals.  Instances  of  such  systems  include  intelligent  robots, 
factory  control  systems,  avionic  systems,  and  medical  monitoring  systems.  Many  software 
architectures  have  been  proposed  to  ease  the  design  and  implementation  of  effective  IRTPSs, 
and  there  has  arisen  the  need  for  some  objective  means  of  evaluating  and  comparing  them. 
As  part  of  the  research  program  on  Intelligent  Real-Time  Problem  Solving  being  sponsored 
by  the  Air  Force,  a  small  working  group  was  formed  to  consider  the  methodologies  for 
experimentally  evaluating  IRTPS  architectures.  This  document  is  a  preliminary  report  of 
that  working  group. 
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3.2  A  Model  of  Embedded  Systems 


The  first  step  in  developing  an  evaluation  methodology  for  IRTPSs  is  to  lay  out  a  conceptual 
framework  for  modeling  systems  and  their  interactions  with  the  environment. 

A  natural  beginning  point  is  with  models  of  dynamic  physical  systems.  Such  systems  can 
be  described  in  terms  of  time  series  of  physical  states,  for  example  as  mappings  (possibly 
stochastic)  from  instants  of  time  to  some  state  space  of  values  that  model  the  physical 
features  of  interest.  One  then  partitions  the  overall  physical  system  into  sub-components 
corresponding  to  the  IRTPS  S  and  the  environment  E ,  each  having  dynamic  local  state  that 
varies  as  a  function  of  signals  received  from  the  other.  We  may  wish  to  regard  the  IRTPS 
and  its  environment  as  being  parametrized  in  various  ways.  Let  S(u)  and  E(v)  represent 
the  system  and  its  environment  with  parameters  u  and  v  respectively.  In  addition,  either  or 
both  of  S  and  E  could  have  random  elements. 

Because  S  and  E  are  dynamic,  time-varying  objects,  we  are  usually  interested  in  de¬ 
scribing  not  only  those  properties  that  hold  statically  of  individual  states,  but  properties  of 
the  time  series  of  states.  For  present  purposes,  we  will  call  these  time  series  runs  and  write 
runs((S,  E))  to  represent  the  set  of  runs  of  the  combined  IRTPS/environment  pair.  Each 
run  is  (conceptually)  a  sequence  of  total  system  states  (i.e.,  states  of  the  IRTPS/environ¬ 
ment  aggregate.)  Conceptually,  a  run  could  be  infinite  and  there  could  be  an  infinite  number 
of  runs. 

Under  this  very  general  model,  the  boundary  between  S  and  E  is  arbitrary  and  can  be 
adjusted  to  address  different  goals.  In  the  present  context,  we  are  concerned  with  evaluating 
proposed  IRTPS  architectures  with  respect  to  their  performance  in  particular  classes  of 
environments.  Thus,  the  S  —  E  boundary  should  be  placed  so  as  to  distinguish  between 
a  proposed  architecture  and  the  environment  in  which  it  is  claimed  to  be  effective.  Then 
we  can  evaluate  the  relationships  between  properties  of  the  architecture  as  manifested  in 
S  and  properties  of  E.  In  particular,  note  that  the  S  —  E  boundary  need  not  correspond 
to  the  boundary  between  a  “complete  agent”  and  its  environment,  but  may  correspond  to 
the  boundary  around  any  “partial  agent”  of  interest.  For  example,  to  evaluate  a  complete 
agent  architecture,  the  S  —  E  boundary  should  encompass  all  perception,  reasoning,  and 
action  elements.  But  to  evaluate  a  perception  architecture,  the  S  —  E  boundary  should  more 
tightly  encompass  only  perceptual  elements,  with  any  reasoning  or  action  elements  treated 
as  part  of  the  environment.  We  might  often  choose  to  treat  particular  sensors  and  effectors 
as  elements  of  E.  As  discussed  below,  for  a  given  placement  of  the  S  —  E  boundary,  we 
will  be  attempting  to  attribute  properties  in  the  environment  to  the  behavior  of  particular 
IRTPSs  and,  by  inference,  to  their  underlying  architectures. 


3.3  Measurement  and  Utility 

In  order  to  describe  the  effectiveness  of  an  IRTPS  architecture  in  controlling  aspects  of  the 
environment,  it  is  necessary  to  identify  measurements  that  can  be  made  and  how  those 
measurements  will  be  interpreted  to  determine  utility. 
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A  measurement  is  any  function  of  state  values.  Measurements  can  be  made  within  a 
state  or  over  sets  of  states  or  runs. 

Under  the  above  model  of  embedded  systems,  for  a  given  S  —  E  boundary,  measurements 
on  E  are  distinguished  from  measurements  on  S.  Measurements  on  E  describe  the  dynamic 
properties  of  the  environment,  some  of  which  are  determined  by  processes  internal  to  the 
environment  and  others  of  which  are  influenced  by  the  behavior  of  S  in  E.  The  latter  sorts 
of  measurements  are  distinguished  and  used  to  assess  the  effectiveness  of  5  in  determining 
properties  of  E  under  various  conditions.  These  assessments  may  be  in  absolute  terms  or 
relative  to  alternative  5s.  Measurements  on  5  describe  the  dynamic  properties  of  the  IRTPS, 
some  of  which  are  determined  by  processes  internal  to  it  and  others  of  which  are  determined 
by  the  impact  of  E  on  5.  These  measurements  are  used  help  to  explain  the  performance  of  5 
and  its  (in)effectiveness  in  determining  properties  of  E  in  terms  of  its  underlying  architecture. 
They  also  are  used  to  analyze  and  predict  its  performance  under  other  values  of  u  and  v. 

Two  classes  of  measurements  are  distinguished  -  descriptive  and  utility.  Descriptive 
measurements  represent  objective  features  of  a  state,  run,  or  set  of  runs.  Examples  are:  (a) 
deadline  d  was  met;  and  (b)  80%  of  deadlines  of  type  t  were  met.  Other  illustrative  simple 
descriptive  measurements  are: 

•  Latency  from  environmental  event  Cj  to  environmental  event  e2  e.g.,  latency  from 
occurrence  of  a  fault  to  occurrence  of  its  correction 

•  Deadline  satisfaction  e.g.,  whether  a  given  fault  is  corrected  by  the  time  of  its  deadline 

•  Logical  correctness  of  result 

•  Quality  of  result 

•  Precision  of  result 

Illustrative  functions  on  measurements  are: 

•  Average  latency  for  critical  events 

•  Percent  deadline  satisfactions  for  critical  events 

•  Sum  of  latencies  for  all  instances  of  event-type-a  to  event-type-b 

•  Average  percentage  over  deadline  on  soft-deadline  events 

Utility  measurements  represent  valuational  conclusions  based  on  the  features  or  qualities 
of  a  state,  run,  or  set  of  runs.  An  example  is:  satisfactory  performance  requires  meeting 
>  95%  of  priority  1  deadlines  and  >  50%  of  priority  2  deadlines.  Other  illustrative  utility 
measurement  (higher  is  better)  are: 

•  Weighted  sum  of  importance  x  deadline  satisfied  (0  or  1)  for  all  events 
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•  Weighted  average  of  response  quality  x  importance  x  deadline  satisfied 

•  Gracefulness  of  degradation  (suitably  formalized) 

The  choice  of  utility  measures  may  be  specific  to  the  S  —  E  boundary  placement.  They 
certainly  will  be  specific  to  the  purpose  of  the  evaluation. 

To  describe  system  Si(u)  parametrically  with  respect  to  various  measurements  of  utility 
against  fixed  (parametric)  environment  E{v)  amounts  to  characterizing  the  expected  utility 
of  Si(u)  as  a  function  of  (u,v),  using  a  variety  of  techniques,  some  of  which  are  described 
below.  Similar  methods  can  be  used  to  compare  two  similarly  parametrized  systems,  Si(u) 
and  52(u),  fixed  (parametric)  environment  E(v). 


3.4  Evaluation 

In  principle,  there  are  many  ways  a  proposed  IRTPS  design  might  be  evaluated.  The  methods 
fall  into  two  general  categories:  analytical  and  experimental.  Because  each  of  these  methods 
has  its  advantages  and  drawbacks,  a  thorough  evaluation  often  requires  both.  In  the  first 
section  below,  we  briefly  mention  some  of  the  advantages  and  disadvantages  of  analytical 
methods.  The  following  section  treats  experimental  methods,  which  are  the  focus  of  this 
document,  in  more  detail. 


3.4.1  Analytical  Evaluation  Methods 

One  method  of  characterizing  system  performance  is  by  establishing  certain  of  its  properties 
through  analytical  techniques.  These  techniques  draw  on  relevant  mathematical  methods 
and  amount  to  proving  theorems. 

The  main  advantage  of  the  analytical  approach  is  the  ability  to  establish  with  mathe¬ 
matical  certainty  very  general  or  universal  statements  about  entire  classes  of  behaviors  and 
phenomena  far  too  numerous  to  enumerate  in  explicit  detail.  For  instance,  it  might  be  shown 
mathematically  that  a  certain  undesired  situation  can  never  arise  given  the  nature  of  the 
environment  and  the  control  system,  or  that  when  a  triggering  event  occurs  a  response  is 
always  generated  within  a  certain  time  period.  Results  of  this  kind  can  be  very  powerful  and 
can  give  us  great  confidence  in  the  systems  we  design. 

Analytical  approaches  are  not  without  their  drawbacks,  however.  The  primary  drawback 
is  that  the  complexity  of  the  systems  being  modeled  gives  rise  to  intractable  mathematical 
models  that  often  resist  analysis.  In  an  attempt  to  render  the  models  tractable,  simplifica¬ 
tions  are  often  made  which  cause  the  models  to  diverge  from  reality  in  ways  that  undercut 
the  usefulness  for  building  the  model  in  the  first  place. 

In  defense  of  analytical  methods  it  should  be  pointed  out  that  there  is  no  way  to  insure 
against  bad  modeling  and  analysis.  Experience  shows,  however,  that  formal  models  are  of 
use  but  often  they  must  be  supplemented  by  other  techniques. 
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3.4.2  Experimental  Evaluation  Methods 


The  other  main  category  of  evaluation  technique  is  experimental.  In  this  approach,  evalua¬ 
tion  is  done  by  measuring  properties  of  particular  instances  of  runs  of  particular  systems  and 
drawing  certain  conclusions.  Experimental  approaches  can  be  further  subclassified  according 
to  whether  they  involve  (a)  the  real  control  system  embedded  in  the  real  environment,  (b) 
the  real  control  system  embedded  in  a  simulated  environment,  (c)  a  simulated  control  system 
in  a  simulated  environment,  or  (d)  some  hybrid  approach.  Real  systems  offer  the  obvious 
advantages  of  evaluating  against  reality,  but  they  are  often  cumbersome  or  even  unavailable, 
may  pose  unacceptable  risks,  etc.  In  the  simulation  approach  to  experimental  evaluation, 
the  environment  and  possibly  the  control  system  as  well  (e.g.,  if  we  are  using  a  conventional 
machine  to  simulate  a  massively  parallel  system  controlling  an  environment)  a  computer 
program  is  run  to  generate  samples  of  the  behavior  of  the  overall  system.  These  samples 
are  then  analyzed  empirically  to  provide  evidence  in  favor  of  certain  conclusions  regarding 
performance  of  the  real  system  in  the  real  environment. 

Experiments  provide  a  framework  for  inductive  inference  of  general  relations  between  ar¬ 
chitectures  and  real-time  performance  based  on  observations.  The  goal  is  to  discover  general 
relations  that  can  be  expected  to  hold  whenever  the  appropriate  conditions  hold.  For  exam¬ 
ple,  one  relation  might  be  that  architecture  A  provides  graceful  degradation  in  performance 
under  increasing  rates  of  environmental  events;  another  relation  might  be  that  architecture 
B  produces  unacceptable  performance  degradation  under  similar  circumstances.  Because 
it  is  infeasible  to  make  observations  of  all  of  the  instances  encompassed  by  a  hypothesized 
relation,  it  becomes  necessary  to  draw  conclusions  about  what  would  happen  for  all  such 
instances  based  on  observation  of  a  few  particular  instances.  For  example,  we  might  draw 
conclusions  about  the  relative  advantages  of  architectures  A  and  B  on  the  basis  of  their 
performance  on  a  small  number  of  environmental  scenarios. 

Drawing  general  conclusions  from  a  small  number  of  observed  instances  is  a  risky  busi¬ 
ness.  A  given  S  realizes  an  abstract  architecture,  A ,  in  a  particular  implementation,  /,  and 
instantiates  it  for  a  body  of  knowledge,  K .  Architectures  themselves  are  complex  artifacts, 
differing  in  both  theoretically  interesting  variables  (e.g.,  knowledge  representation,  inference 
procedures,  control  mechanism)  and  incidental  variables  (e.g.,  implementation  details,  ex¬ 
ecution  environment).  Similarly,  different  environmental  scenarios  differ  in  a  great  many 
variables  (e.g.,  frequency  of  important  events,  distribution  of  deadlines,  amount  of  interpre¬ 
tation  required,  predictability  of  events).  As  a  consequence,  any  given  observation  of  the 
performance  of  a  given  architecture  on  a  given  environmental  scenario  is  likely  to  reflect  the 
combined  effects  of  many  such  variables. 

Controlled  experimentation  is  an  attempt  to  reduce  as  much  as  possible  the  incidental 
variability  in  a  set  of  observations  in  order  to:  (a)  obtain  a  reliable  account  of  particular 
effects  of  a  particular  set  of  theoretically  interesting  variables;  and  (b)  rigorously  bound 
the  class  of  situations  in  which  those  effects  can  be  expected  to  obtain.  In  a  controlled 
experiment,  one  or  more  “independent  variables”  are  manipulated  and  their  distinctive  effects 
on  one  or  more  “dependent”  variables  are  measured.  In  the  present  context,  we  will  typically 
be  manipulating  independent  variables  representing  a  proposed  architecture  within  S  and 
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measuring  dependent  variables  representing  the  performance  of  interest  within  E.  Other 
variables,  including  both  S  and  E  variables,  are  “controlled”  to  avoid  confounding  their 
effects  with  those  of  the  independent  variables.  We  also  will  often  evaluate  the  performance 
of  a  fixed  S  as  a  function  of  various  E  scenarios;  for  this,  we  keep  5  fixed  and  the  dependent 
and  independent  variables  are  in  E. 

Some  controlled  variables  are  simply  held  constant,  while  others  are  systematically  ma¬ 
nipulated  or  randomly  sampled  to  provide  a  basis  for  generalization.  In  the  domain  of  IRTPS 
systems,  there  are  a  variety  of  important  variables  we  may  wish  to  control: 

S  variables: 

•  sensors 

•  effectors 

•  knowledge  available 

•  computing  resources 

•  responsibilities 

E  variables: 

•  domain 

•  rate  of  critical/non-critical  events 

•  distribution  of  deadlines 

•  complexity  of  events  (e.g.,  multi-variate,  temporal  properties,  noisy,  uncertain) 

•  complexity  of  required  effects  of  actions 

•  complexity  of  reasoning  required 

•  knowledge  required 

•  tasks  required 

In  general,  the  more  thoroughly  variables  are  sampled  within  a  class,  the  more  reliably 
we  can  generalize  conclusions  based  on  associated  observations.  Statistical  inference  tech¬ 
niques  permit  probabilistic  statements  about  the  likelihood  that  relations  observed  in  a  given 
number  and  distribution  of  instances  will  hold  for  the  entire  class  of  such  instances. 

For  example,  to  compare  two  different  control  mechanisms,  A  and  B,  we  might  set  up 
two  complete  agent  architectures  differing  only  on  this  single  independent  variable,  while 
holding  constant  all  other  architectural  variables.  In  effect,  we  would  be  placing  the  S  —  E 
boundary  tightly  around  the  control  mechanism.  We  might  then  apply  the  architectures  to 
a  set  of  scenarios  in  which  we  hold  constant  the  environmental  domain  (e.g.,  power  plant 
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monitoring)  and  systematically  manipulate  the  frequencies  of  critical  and  non-critical  events 
and  the  associated  distributions  of  deadlines.  In  each  condition  (unique  combination  of  values 
of  independent  and  controlled  variables),  we  would  measure  several  dependent  variables,  such 
as  logical  correctness  of  response  and  satisfaction  of  deadlines  for  critical  and  non-critical 
events.  Given  an  appropriate  statistical  analysis  of  the  results  of  these  measurements,  we 
might  draw  certain  conclusions  with  high  confidence,  for  example:  (a)  for  critical  events  in  all 
conditions,  control  mechanism  A  produces  a  higher  rate  of  correct  answers  within  deadline 
than  control  mechanism  B  (97%  vs.  75%);  (b)  for  non-critical  events  in  all  conditions, 
control  mechanism  B  produces  a  higher  rate  of  correct  answers  within  deadline  than  control 
mechanism  A  (75%  vs.  65%);  (c)  as  the  frequency  of  events  increases  (1— >50  events  per  unit 
time),  control  mechanism  A's  performance  on  critical  events  degrades  slowly  (100— *95%), 
while  its  performance  on  non-critical  events  declines  dramatically  90 — >40%);  and  (d)  as  the 
frequency  of  events  increases,  control  mechanism  B's  performance  declines  significantly  for 
both  critical  and  non-critical  events  (95— *50%  in  both  cases).  Given  a  particular  set  of  utility 
functions-in  particular,  valuing  critical  events  more  highly  than  non-critical  events-we  might 
conclude  from  these  results  that  control  mechanism  A  is  “better”  than  control  mechanism 
B  because  it  provides  “better”  performance  overall  and  a  “better”  degradation  profile. 

Control  of  variables  determines  what  kinds  of  conclusions  an  experiment  can  support. 
For  example,  observing  the  performance  of  S\  and  S 2  on  a  single  scenario,  Oj,  in  a  single 
environment,  E\ ,  permits  only  conclusions  about  the  comparative  effectiveness  of  S\  and 
S2  on  scenario  0\.  Observing  performance  on  a  representative  sample  of  scenarios  in  E\ 
permits  conclusions  about  the  comparative  effectiveness  of  St  and  S2  in  environment  Et. 
Observing  performance  on  a  sample  of  scenarios  in  a  sample  of  environments  from  class  E 
permits  conclusions  about  the  comparative  effectiveness  of  S i  and  S2  in  environment  class 
E. 

Of  course  some  variables  are  quite  difficult  to  control,  and  these  necessarily  limit  the 
conclusions  that  can  be  drawn  from  experiments.  In  particular,  it  is  difficult  to  separate  and 
control  variables  that  distinguish  among  the  architecture,  implementation,  and  knowledge 
of  a  given  system  S.  Nonetheless,  if  we  wish  to  draw  strong  conclusions  about  the  utility 
of  an  architecture,  we  must  control  these  potentially  confounding  variables.  In  many  cases, 
our  experiments  may  permit  conclusions  only  about  the  level  of  performance  of  an  5  or  the 
relative  performances  of  alternative  5s.  It  may  require  analytical  methods-or  be  infeasible-to 
determine  whether  an  S' s  performance  advantage  is  due  to  its  architecture,  implementation, 
or  knowledge. 


3.5  Other  Implications 

Although  the  primary  purpose  of  this  document  concerns  methodologies  for  evaluating 
IRTPS  architectures,  these  considerations  also  carry  implications  regarding  requirements 
specification  and  experimental  testbed. 
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3.5.1  Implications  for  Requirements  Specification 

At  this  early  stage  of  IRTPS  research,  we  have  many  proposals  for  IRTPS  requirements  that 
use  idiosyncratic,  but  overlapping  vocabularies  to  describe  concepts  whose  relationships  to 
one  another  are  ambiguous.  The  proposed  model  of  embedded  systems  offers  an  opportunity 
to  operationalize  intuitive  concepts  like  reactivity,  coherence,  interruptability,  etc.  in  terms  of 
basic  measurements  on  state  values.  Thus,  it  will  become  clear,  for  example,  that  researcher 
A  uses  the  term  “interruptability”  to  refer  to  the  latency  to  respond  to  a  critical  event,  while 
researcher  B  uses  the  same  term  to  refer  to  the  ability  to  abort  an  ongoing  computation. 
Although  this  does  not  insure  agreement  on  terms  and  definitions,  it  provides  a  “lingua 
franca”  for  communication  about  terms  and  definitions. 

The  concept  of  an  5  —  E  boundary  is  fundamental  to  the  proposed  model  of  embedded 
systems.  Although  placement  of  the  boundary  is  flexible,  to  allow  study  of  IRTPSs  of 
differing  scopes,  a  given  placement  of  the  boundary  structures  the  definition  of  requirements 
and  assessment  of  their  satisfaction.  Thus,  on  one  side  of  the  boundary,  we  have  the  5  whose 
“requirements”  constitute  a  theory  or  design  that  is  hypothesized  or  intended  to  produce 
satisfactory  consequences  in  E.  On  the  other  side  of  the  boundary,  we  have  the  E  whose 
“requirements”  define  the  so-called  satisfactory  consequences.  An  informative  experiment 
will  tell  us  to  what  degree  the  requirements  hypothesized  for  5  (and  realized  in  a  particular 
implementation)  actually  achieve  the  requirements  demanded  for  E. 

For  example,  we  call  systems  “reactive”  in  (at  least)  two  different  situations.  First,  we 
speak  of  reactive  systems  that  iterate  a  highly  efficient  sense-act  loop.  Second,  we  speak 
of  systems  as  reactive  if  they  react  promptly  to  important  external  events.  Under  the  pro¬ 
posed  model  of  embedded  systems,  the  first  sense  of  reactive  is  a  hypothesized  requirement 
on  5,  5- Reactivity  while  the  second  is  a  defined  requirement  on  E,  call  it  E- Reactivity. 
The  testable  claim  is  that  5-Reactivity  produces  E-Reactivity  (and  perhaps  some  other 
desirable  E- Requirements  as  well).  Notice,  however,  that  other  testable  claims  are  possi¬ 
ble,  for  example  that  a  non-reactive  5  (one  whose  architecture  is  something  different  from 
the  above-mentioned  sense-act  loop)  also  produces  E-Reactivity  (and  perhaps  some  other 
desirable  E- Requirements  as  well).  The  purpose  of  experiments  is  to  evaluate  such  claims. 

Accordingly,  the  model  of  embedded  systems  suggests  that  efforts  to  specify  requirements 
for  IRTF)Ss  clearly  distinguish  between  5- Requirements  and  ^-Requirements.  In  particular, 
E- Requirements  should  be  defined  strictly  in  terms  of  measurements  on  E  variables,  while 
5- Requirements  should  be  defined  in  terms  of  measurements  on  5  variables.  For  example,  5- 
Reactivity  might  be  defined  as  a  bound  on  the  computation  performed  between  sensing  and 
acting.  E-Reactivity  might  be  defined  as  a  bound  on  the  latency  between  occurrence  of  an 
important  “problem”  event  and  the  occurrence  of  an  appropriate  external  “correction”  event. 
Moreover,  a  given  experiment  requires  agreement  among  participants  on  the  E-Requirements 
against  which  alternative  5-Requirements  will  be  evaluated. 
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3.6  Implications  for  an  Experimental  Testbed 


As  discussed  throughout  this  document,  IRTPSs  are  complex  artifacts  embedded  in  complex 
environments.  Experiments  that  allow  generalization  of  conclusions  beyond  the  immediate 
experimental  conditions  require  control  of  many  variables  in  both  the  S  and  the  E.  In 
addition,  experimentation  on  S' s  of  differing  scopes  requires  flexibility  in  the  placement  of 
the  S  —  E  boundary.  To  support  these  kinds  of  experimentation  requires  a  sophisticated 
testbed. 

For  example,  a  basic  testbed  would  allow  one  with  an  S  to  run  and  experiment  with  it. 
The  testbed  provides  an  E  (likely  simulated,  and  also  perhaps  paremeterized)  and  defines 
the  S  —  E  boundary  by  the  interface  functions  and  data  structures  through  which  S  and 
E  interact.  The  testbed  should  also  provide  means  for  controlling  multiple  runs  and  for 
collecting  measurements  on  E  and,  perhaps,  on  S  as  well.  Ideally  a  testbed  also  provides 
utilities  for  analyzing  the  results  (i.e.,  the  measurements)  across  multiple  runs. 

A  more  general  testbed  facility  would  not  have  the  E  built  in,  but  would  instead  accept 
both  the  E  and  the  S  as  inputs.  That  is,  it  defines  a  generic  interface  between  any  E  and 
any  S  compatible  with  that  E.  It  would  accomplish  this  by  defining  an  interface  between 
the  testbed  itself  and  S,  and  between  the  testbed  and  E.  These  interfaces  would  allow 
the  testbed  to  control  each  of  S  and  E ,  to  support  their  interactions,  and  to  collect  the 
measurements. 

For  still  more  flexibility,  a  testbed  would  support  experiments  with  varying  boundaries 
between  S  and  E.  That  is,  the  experimenter  would  supply  a  complete,  modular  system  to 
the  testbed  and  specify,  for  any  run,  where  the  S  —  E  boundary  is.  This  requires  a  more 
extended  interface  definition  facility  -  one  which  supports  a  complex  of  interacting  modules, 
preferably  composable,  and  allows  for  measurements  on  any  of  their  interfaces.  Indeed,  we 
can  generalize  this  concept  so  that  the  S  —  E  boundary  is  not  even  fixed  for  a  run,  but 
shows  up  only  in  terms  of  which  measurements  are  designated  the  utility  measurements. 
This  reflects  the  idea  that  the  S  and  E  together  are  a  complete  system,  and  specifying  the 
boundary  is  only  a  means  for  analysis  and  evaluation. 
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Chapter  4 

Report  on  Research  Issues  Session  of 
IRTPS  Workshop 


4.1  Introduction 

The  working  group  on  research  issues  was  given  the  following  set  of  tasks: 

1.  Propose  IRTPS-related  research  areas. 

2.  Consider  specific  research  topics  relevant  to  IRTPS  within  each  area. 

3.  Estimate  each  topic’s  potential  value  to  IRTPS  and  its  difficulty. 

4.  Identify  research  areas  and  topics  less  suitable  for  important,  near-term  IRTPS  re¬ 
search. 

Our  deliberation  was  to  include  non-AI  research  areas,  and  we  were  asked  to  list  research 
areas  in  other  disciplines  (both  within  and  outside  of  computer  science)  that  are  important  to 
IRTPS  and  suggest  the  most  important  IRTPS-related  research  topics  within  these  non-AI 
areas. 

We  began  by  considering  the  following  four-stage  model  of  how  research  on  a  major 
problem  like  IRTPS  might  progress: 

1.  Paradigms  (originally  labelled  “models”) 

2.  Theoretical  questions 

3.  Algorithm  development 

4.  Technology  base  (tools  and  design  principles) 
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Although  the  discussants  generally  felt  that  all  of  these  stages  are  important  to  IRTPS, 
the  discussion  focused  primarily  on  the  first  two  stages  and  the  relationships  between  them. 

The  discussion  of  this  model  of  scientific  progress  produced  the  following: 

1.  Instead  of  “paradigms”  the  term  “models”  was  originally  used  for  the  first  stage.  How¬ 
ever,  subsequent  discussion  revealed  that  we  had  a  more  general  range  of  activities 
in  mind  than  the  descriptive  activity  of  modeling.  Paradigms  entail  broad  commit¬ 
ments  to  (a)  prioritization  of  issues  to  be  addressed,  (b)  a  general  vocabulary  and 
set  of  constraints  on  its  use  in  describing  the  phenomena  of  interest,  and  (c)  pre¬ 
ferred  methodology  for  assessing  alternative  descriptions  (predictions)  and  techniques 
produced  within  (i.e.,  consistent  with)  the  paradigm.  Therefore,  commitment  to  a 
paradigm  constrains  its  user  both  conceptually  and  methodologically  in  explorations 
of  some  set  of  phenomena.  Initial  activity  at  this  stage  centers  around  very  informal 
distinctions  and  constructs.  For  quite  some  time  there  may  be  many,  equally  plausi¬ 
ble,  candidate  paradigms.  As  a  field  matures  the  predominant  paradigms  are  likely 
to  yield  formalized  theory  languages  within  which  detailed  models  can  be  specified. 
Prioritization  of  research  issues  is  also  significantly  sharpened,  and  the  methodology 
for  measuring  progress  is  also  formalized. 

2.  The  role  of  “theoretical  questions”  was  less  clear.  Some  participants  in  our  discus¬ 
sion  emphasized  how  paradigms  enable  the  formulation  and  examination  of  theoretical 
questions.  In  this  sense,  maturation  of  a  paradigm  logically  precedes  the  theoretical 
examination  used  to  refine  that  paradigm.  Other  participants  emphasized  the  many 
important  theoretical  questions  that  must  be  asked  and  answered  in  order  to  formulate 
a  paradigm  in  the  first  place.  Such  theoretical  questions  may  be  “pre-paradigmatic,”  or 
even  “non-paradigmatic.”  In  this  alternative  sense,  examination  of  certain  theoretical 
questions  logically  precedes  articulation  of  a  paradigm. 

3.  Both  of  these  views  are  valid.  They  provide  complementary  accounts  of  the  relationship 
of  theory  development  to  analysis  and  experimentation.  By  combining  these  accounts 
one  comes  to  see  scientific  progress  as  a  complex,  evolutionary  process  rather  than  a 
simple  sequential  ordering  of  stages. 

4.  The  theoretical  questions  that  may  be  asked  at  the  pre-paradigmatic  stage  are  more 
likely  to  focus  on  broad  issues  and  to  be  only  informally  stated.  However,  these  informal 
questions  play  a  very  important  role  in  the  early  evolution  of  a  scientific  discipline  by 
helping  to  articulate  and  prioritize  the  concepts  and  methodology  on  which  some  more 
formal  paradigm  will  stand. 

5.  For  computationally  oriented  topics  such  as  IRTPS,  algorithm  development  (stage  3) 
and  technology  creation  (stage  4)  play  an  essential  role  as  conclusive  analytic  and 
empirical  tests,  respectively,  of  the  concepts  and  methods  produced  by  a  paradigm. 


This  broad  discussion  of  scientific  progress  provided  us  a  useful  context  within  which  to 
focus  more  specifically  on  the  IRTPS  initiative: 
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1.  IRTPS  research  is  at  an  early  stage  of  evolution.  Consequently,  it  is  important  to  sup¬ 
port  careful,  though  informal,  examination  of  concepts  and  issues  that  may  point  the 
way  toward  choosing  among  competing  paradigms  or  creating  a  new,  more  appropriate 
paradigm.  There  are  many  candidates  for  the  role  as  a  dominant  IRTPS  paradigm. 
These  need  to  be  better  understood  and  evaluated. 

2.  IRTPS  researchers  would  benefit  greatly  from  the  existence  of  “paradigm  problems,” 
simplified  problem  instances  whose  features  exemplify  critical  and  broadly  occurring 
features  of  the  full  range  of  IRTPS  problems.  For  example,  the  well-known  Sussman 
anomaly  within  the  M.I.T.  “blocks  worlds”  serves  as  a  paradigm  problem  that  has 
stimulated  much  work  within  AI  on  reasoning  about  action.  IRTPS  needs  its  own 
stock  of  paradigm  problems.  However,  it  is  not  likely  that  one  may  successfully  set  out 
to  formulate  a  paradigm  problem.  A  problem  achieves  this  status  only  if  it  becomes  a 
widely  studied  problem,  whose  interpretation  is  generally  agreed  to. 

In  keeping  with  our  discussion  of  the  evolution  of  research,  we  attempted  to  generate  plau¬ 
sible  paradigms  (models)  around  which  IRTPS  research  could  be  organized.  The  next  section 
presents  the  results  of  this  analysis.  We  then  exploited  our  knowledge  of  these  paradigms 
to  generate  candidate  research  areas  upon  which  effort  might  be  focused  in  the  IRTPS  ini¬ 
tiative,  to  refine  topics  within  these  these  research  areas,  and  evaluate  their  significance  for 
IRTPS. 


4.2  Candidate  IRTPS  Paradigms 

Challenging  concepts  underlie  the  idea  of  an  IRTPS  system.  Scientists  are  far  from  agreeing 
on  the  meaning  of  the  concept  of  an  “intelligent  system”  (whether  computational  or  not). 
Nor  are  they  ready  to  commit  to  a  common  understanding  of  what  “problem  solving”  entails. 
Even  the  term  “real-time”  tends  to  generate  vigorous  debate.  A  number  of  disciplines  suggest 
alternative  perspectives  and  approaches  to  the  development  of  IRTPS  systems.  Some  of  the 
more  prominent  candidates  include  the  following: 

4.2.1  Planning 

This  paradigm  represents  the  AI  mainstream.  Problem-solving  is  conceived  as  goal-driven 
reasoning  about  action,  planning,  and  “meta-planning.”  Advocates  of  this  paradigm  expect 
that  further  research  will  enable  them  to  extend  its  basic  concepts  and  techniques  so  that 
these  methods  exhibit  real-time  performance. 


4.2.2  Logics 

This  paradigm  is  popular  at  Stanford’s  Center  for  the  Study  of  Language  and  Information, 
among  other  places.  It  proposes  to  use  special  extensions  of  classical  first-order  predicate 
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calculus  to  formally  describe  problem-solving,  and  now  IRTPS,  in  terms  of  the  interactions 
of  an  agent’s  beliefs,  desires,  and  intentions.  One  set  of  logic-based  approaches  involves  ex¬ 
tending  classical  logic  with  modal  operators,  axioms,  and  inference  rules  intended  to  capture 
the  “logic”  of  reasoning  with  beliefs,  desires,  etc.  Another  approach  involves  adding  mecha¬ 
nisms  for  so-called  “non-monotonic  reasoning”  that  allow  inference  with  assertions  that  may 
under  some  circumstances  be  retracted. 


4.2.3  Algebraic  Techniques 

This  paradigm  was  suggested  by  one  participant  in  our  discussion  as  an  alternative  to  logic- 
based  approaches.  The  same  goals  hold  but,  in  this  paradigm,  one  uses  and  extends  such 
formal  constructs  as  universal  algebra  and  similar  formal  systems.  These  algebraic  tech¬ 
niques,  rather  than  predicate  logic,  provide  the  underlying  formal  system. 


4.2.4  Decision  Theory 

This  paradigm  views  problem-solving  as  choice  among  alternative  actions  on  the  basis  of  pref¬ 
erence.  In  this  view,  problem-solving  is  represented  as  choosing  those  actions  that  maximize 
a  problem-solver’s  subjective  utility.  It  puts  forward  the  principle  of  maximizing  (subjective) 
expected  utility  as  a  “gold  standard”  for  characterizing  ideal  rational  action.  Advocates  of 
this  paradigm  suggest  that  real-time  problem-solving  involves  making  rational  choices  while 
also  taking  into  account  the  costs  of  thinking  (i.e.,  computation)  and  of  missing  deadlines. 
IRTPS  is  thus  conceived  of  as  entailing  a  process  to  manage  scarce  problem-solving  resources 
such  as  time  and  information.  Decision  theory  is  sufficiently  mature  as  a  paradigm  to  have 
well-tested  mathematical  methods  for  explicating  its  view.  The  formal  methods  of  deci¬ 
sion  theory  enable  the  explicit  modeling  of  the  problem-solver’s  probabilistic  uncertainty,  its 
preferences,  and  its  attitude  towards  risk. 


4.2.5  Control  Theory 

This  paradigm  has  culminated  in  a  mature  academic  discipline  that  has  produced  a  large 
amount  of  practical  technology  for  building  real-time  systems.  Modern  control  systems 
manage  complex  distributed  physical  processes,  and  often  do  so  while  meeting  real-time  per¬ 
formance  constraints.  Control  theory  has  also  produced  practical  technology  for  building 
learning  systems.  Modern  control  theorists  often  conceive  of  an  advanced  control  system 
as  a  non-trivial  embodiment  of  intelligent  (or  at  least  intelligently  created)  problem-solving 
processes.  Control  theory  and  decision  theory  are  closely  related.  Indeed,  the  theoretical 
language  of  decision  theory  can  be  used  to  describe  many  of  control  theory’s  concepts  and 
formal  constructs.  However  these  paradigms  differ  in  some  important  ways.  In  particular, 
applications  of  control-theoretic  methods  often  model  a  “closed  loop”  relationship  between 
the  actions  of  a  controlling  system  and  the  controlled  system  that  it  manages.  By  contrast, 
the  formal  methods  of  decision  theory  make  it  difficult  to  model  feedback.  Secondly,  ad¬ 
vanced  control  theories  have  been  developed  that  emphasize  specifiying  a  control  response 
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by  analyzing  the  feedback  and  other  observations  of  the  “plant”  (the  controlled  system)  in 
terms  of  a  reference  model  available  to  the  controlling  system.  Model- reference  methods  can 
emphasize  qualitative  issues  that  have  received  less  attention  in  decision  theory.  Examples 
include  use  of  a  reference  model  that  only  approximately  models  the  actual  dynamics  of  the 
plant  and  methods  for  replacing  a  state-based  model  with  one  that  has  fewer  states  but  which 
can  still  be  used  to  meet  the  original  control  criterion  (i.e.,  so-called  state-aggregation  meth¬ 
ods).  In  sum,  although  control  theory  and  decision  theory  are  closely  related  paradigms,  they 
provide  significantly  different  emphases  and  may  even  turn  out  to  be  formally  incompatible. 

4.2.6  State-Based  Automata  Methods 

One  discussant  proposed  state-based  automata  methods  as  an  alternative  paradigm.  The 
principal  distinction  of  this  paradigm  seems  to  lie  in  its  use  of  the  mathematics  of  automata 
theory  for  formal  modeling  and  analysis.  There  may  also  be  some  increased  emphasis  upon 
the  control  of  non-linear  systems  in  this  approach,  although  this  difference  from  standard 
control  theory  has  diminished  in  recent  years.  In  fact,  many  state-based  methods  are  well- 
known  to  control  theorists,  and  many  of  the  standard  problems  defined  within  the  state-based 
paradigm  are  similar  or  identical  to  the  standard  problems  emphasized  by  control  theorists, 
e.g.,  reachability,  identifiability,  stability,  convergence,  and  controllability. 

4.2.7  No-Paradigm 

Several  of  the  participants  pointed  out  that  IRTPS  (and  AI  in  general)  is  arguably  at  a  very 
early,  pre-paradigmatic  stage.  No  existing  paradigm  obviously  provides  explicit  coverage 
of  all  IRTPS  issues.  In  particular,  no  existing  approach  can  readily  claim  to  have  been 
formulated  explicitly  with  IRTPS  in  mind.  So,  we  should  not  attach  ourselves  to  one  of 
these  existing  paradigms  with  the  illusion  that  it  provides  a  suitable  basis  for  investigating 
IRTPS.  Given  that  this  is  the  case,  there  are  at  least  three  alternative  courses  of  action: 

1.  Do  something  to  develop  a  new  paradigm  from  scratch.  Unfortunately,  it  is  difficult  to 
imagine  how  to  guide  this  process,  or  even  to  suggest  how  work  of  this  type  could  be 
evaluated  or  supported.  Most  likely,  one  can  judge  the  value  of  a  new  paradigm  only 
in  retrospect. 

2.  Compare  alternative  existing  paradigms  and  then  adopt  the  best  ideas  from  the  clos¬ 
est  or  some  combination.  This  variant  probably  describes  the  approach  of  many  Al 
researchers  in  the  history  of  that  field.  The  paradigm  labelled  “Logics”  above  may 
exemplify  this  to  a  certain  extent. 

3.  Look  for  some  deeper,  more  foundational  paradigm  within  which  IRTPS  is  a  “special 
case.”  This  alternative  is  strongly  reductionist  and  presumes  that  one  has  a  coherent 
sense  of  the  dimension  of  reduction  and  a  convincing  argument  for  the  acceptability  of 
the  chosen  paradigm  for  the  underlying  level.  As  an  example,  some  AI  researchers  have 
attempted  to  rely  upon  work  at  the  foundation  of  mathematics  as  a  deeper  conceptual 
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and  formal  basis  for  their  approach  to  AI  problems.  Prima  facie,  this  strategy  may 
be  problematic  for  IRTPS  because  of  foundational  difficulties  in  modeling  notions  of 
process  and  time. 


4.3  Toward  an  IRTPS  Research  Agenda 

Our  discussion  group  contained  advocates  of  all  the  candidate  paradigms  discussed  above. 
We  exploited  these  diverse  perspectives  in  generating  a  set  of  candidate  IRTPS  research 
areas. 

4.3.1  Research  Areas 

After  some  discussion  and  re-organization,  our  group’s  participants  agreed  on  a  rough  outline 
of  the  areas  and  sub-areas  within  which  they  expect  to  find  research  problems  that  are  critical 
to  IRTPS.  They  distinguished  these  areas  as  being  either  issues  of  resource-constrained 
reasoning  or  problems  of  modeling  a  real-time  environment.  The  subsequent  discussion  of 
topics  (reviewed  in  the  next  section)  focused  only  on  the  former  (with  some  rearrangement.) 

The  following  outlines  critical  IRTPS-related  research  areas: 

1.  Problem-solving  under  resource  constraints 

(a)  Limited  resource  reasoning 

i.  Controlling  focus  of  attention 

ii.  Hierarchy  of  reflection 

iii.  Temporal  reasoning 

iv.  “Anytime”  reasoning 

(b)  Managing  varying  resources  at  “runtime” 

(c)  Managing  competing  objectives 

(d)  Reasoning  about  resources 

2.  Modeling  features  of  a  complex,  dynamic  environment 

(a)  Uncertainty  and  unpredictability 

(b)  Time-dependent  events  and  action  outcomes 

4.3.2  Prioritization 

After  settling  on  this  preliminary  categorization  of  IRTPS  research  issues,  our  afternoon 
session  focused  on  two  subsidiary  goals:  First,  we  attempted  to  identify  fairly  specific  research 
questions  within  each  of  the  broad  areas  that  had  been  discussed  in  the  morning  and  second, 
we  attempted  to  evaluate  each  of  these  research  questions  with  regard  to  three  properties: 
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1.  How  likely  was  it  that  progress  would  be  made  in  this  specific  area  in  the  next  two 
years? 

2.  How  important  to  the  IRTPS  venture  would  this  progress  be? 

3.  How  “easy”  was  the  problem?  This  term  was  not  defined  further,  but  seemed  to 
measure  the  group’s  collective  confidence  that  progress  predicted  in  (1)  would  in  fact 
be  achieved. 


Of  course,  all  of  the  decisions  made  about  these  questions  were  entirely  subjective,  and 
should  be  taken  principally  as  reflecting  the  opinions  and  biases  of  the  research  issues  working 
group. 


Short-Term,  Important  Issues 

The  following  research  problems  were  felt  to  be  important  ones  on  which  progress  could  be 
expected  in  the  near  term: 

Compilation  The  problems  here  are  those  of  preprocessing  information  so  that  it  can  be 
accessed  more  quickly  when  it  is  needed,  or  transforming  one  representation  of  a  process 
to  another,  more  efficient,  representation.  The  techniques  mentioned  included  inductive 
methods,  the  synthesis  of  causal  theories,  case-based  reasoning,  and  neural  networks.  This 
is  such  an  active  research  area  already  that  it  was  felt  certain  that  progress  would  indeed  be 
made  over  the  next  two  years,  and  compilation  was  therefore  labelled  “easy.” 


Resource  Representation  and  Monitoring  What  problems  are  involved  in  representing 
the  fact  that  IRTPS  systems  can  be  expected  to  encounter  resource  limitations?  What  needs 
to  be  done  to  make  it  practical  for  these  systems  to  monitor  their  resource  consumption  and 
needs?  The  first  of  these  was  felt  to  be  easy;  the  second,  less  so. 


Time-Dependent  Utility  How  can  an  agent  expect  the  utility  of  achieving  certain  goals 
to  vary  over  time?  Not  easy. 


Metareasoning  Applied  to  Planning  and  to  Search  Applying  some  sort  of  metalevel 
reasoning  to  control  search,  to  determine  when  replanning  is  needed,  or  to  weigh  competing 
subgoals.  Easy. 


Sensory  Planning  Planning  to  acquire  new  information  through  the  use  of  sensors.  Easy. 

Plan  Monitoring  Given  that  a  plan  has  been  constructed  to  achieve  some  real-time  goal, 
how  can  progress  be  monitored  as  the  plan  is  executed?  Not  easy. 
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Anytime  Reasoning  Here,  we  identified  a  variety  of  subproblems  that  could  be  lumped 
under  the  title  of  “anytime  inference:” 

1.  Redefining  inference  in  a  way  that  allows  the  inference  process  to  be  interrupted,  and 
constructing  algorithms  that  capture  this  new  definition. 

2.  Constructing  measures  on  the  value  of  an  incomplete  answer  to  a  declarative  query. 

3.  Defining  a  notion  of  an  “approximate”  answer  to  a  declarative  query.  (Not  short-term.) 

4.  Anytime  planning,  by  which  we  mean  the  development  of  planning  systems  that  work 
by  progressively  improving/refining  partial  plans. 

None  of  these  problems  was  felt  to  be  easy. 


The  Application  of  Decision  Theory  to  the  Problem  of  Focusing  Attention  What 
should  the  agent  concentrate  on  next?  Which  of  its  sensors  should  be  polled  at  any  particular 
time?  Not  easy. 

Finally,  we  included  in  this  group  of  problems  that  of  investigating  the  foundations  of 
intelligent,  real-time  problem  solving.  It  was  felt  that  progress  would  inevitably  be  made 
here. 


Important  Issues,  But  Not  Short-Term 

These  problems  were  generally  felt  to  be  harder  than  those  in  the  previous  section: 

1.  Resource  planning.  Planning  to  obtain  more  resources. 

2.  Process  representation.  Understanding  and  representing  the  time-dependent  processes 
with  which  IRTPS  systems  will  need  to  interact. 

3.  Utility  of  metareasoning  and  learning.  Given  that  we  reason  about  our  own  problem¬ 
solving  activity  and  that  we  learn,  under  what  circumstances  are  these  mental  activities 
useful  ones? 

4.  Understanding  and  being  able  to  deal  with  competing  goals. 

5.  Understanding  dynamic  focus  of  attention.  How  is  it  that  an  agent’s  focus  of  attention 
changes  from  time  to  time?  What  should  control  this  process? 

6.  Understanding  partial  or  incomplete  plans;  debugging  incorrect  plans. 
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Not  Important 

Finally,  several  research  areas  were  identified  that  were  felt  to  be  of  lesser  importance  to  the 
IRTPS  effort.  In  some  cases,  this  was  because  these  problems  were  not  felt  to  lie  within  the 
IRTPS  domain  specifically;  in  others,  the  problems  simply  failed  to  generate  any  enthusiasm 
within  the  research  issues  working  group. 

1.  Learning  metalevel  information  by  induction. 

2.  Caching.  By  this  we  mean  the  study  of  data  structures  and  mechanisms  that  could  be 
used  to  save  previously  computed  results. 

3.  Metalevel  reasoning  applied  to  scheduling,  to  situation  understanding,  and  to  proba¬ 
bilistic  computation. 

4.  The  application  of  operating  systems  concepts  (interrupts,  priorities,  etc.)  to  the 
problem  of  focusing  attention. 

5.  Defining  the  concept  of  a  plan. 

6.  Planning  for  goals  that  are  partially  satisfiable  or  temporally  scoped  (such  as  that  of 
maintaining  some  external  condition).  This  and  the  previous  topic  were  felt  to  be  the 
domain  of  the  planning  community  proper  and  outside  the  scope  of  IRTPS  per  se. 

4.4  Concluding  Remarks 

The  working  group  was  aware  that  the  background  and  interests  of  its  members  may  not 
have  been  fully  representative  of  the  IRTPS  community  as  a  whole  and  may  have  introduced 
certain  biases  into  its  conclusions.  For  example,  the  lack  of  representation  of  more  researchers 
with  strong  applications  experience  probably  limited  our  ability  to  generate  a  comprehensive 
list  of  the  “right”  questions.  We  were  also  restricted  in  our  ability  to  propose  paradigms  be¬ 
cause  important,  non-AI  disciplines  were  under-represented.  The  “AI  planning”  contingent, 
on  the  other  hand,  was  proportionately  over-represented,  and  as  a  result,  our  discussion  of 
research  topics  tended  to  be  biased  towards  issues  within  that  paradigm.  These  observations 
about  the  composition  of  the  group  are  offered  not  to  diminish  the  validity  of  our  conclusions, 
but  rather  to  place  them  in  their  appropriate  context. 
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Chapter  5 

Annotated  Bibliography 


The  following  is  a  bibliography  containing  a  representative  collection  of  references  to  litera¬ 
ture  relevant  to  intelligent  real-time  problem-solving.  Each  item  has  been  annotated  with  a 
brief  description  of  its  content  and  its  relation  to  the  field. 
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